Minimax number of strata for online stratified sampling: The case of noisy samples

نویسندگان

  • Alexandra Carpentier
  • Rémi Munos
چکیده

We consider online stratified sampling for Monte Carlo estimation of the integral of a function given a finite budget n of noisy evaluations to the function. In this paper we address the problem of choosing the best number K of strata as a function of n. A large K provides a high quality stratification where an accurate estimate of the integral of f could be computed by an optimal oracle allocation if the variances within each stratum were known. However the performance of an adaptive allocation (which does not know the variance within the strata) compared to the oracle one deteriorates with K. This defines a trade-off between the stratification quality and the pseudo-regret of an adaptive strategy. First we provide an improved pseudo-regret upper-bound of order Õ(K1/3n−4/3) for the adaptive allocation MC-UCB introduced in [1]. Then we prove a lower-bound on the pseudo-regret of same order, both in terms of K and n, up to a logarithmic factor. Finally we explain how to choose the best value of K given the budget n and deduce a tight minimax (on the class of Hölder continuous functions) optimal bound on the difference between the performance of the adaptive allocation MC-UCB, and the performance of the estimate returned by the optimal oracle strategy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimax Number of Strata for Online Stratified Sampling Given Noisy Samples

We consider the problem of online stratified sampling for Monte Carlo integration of a function given a finite budget of n noisy evaluations to the function. More precisely we focus on the problem of choosing the number of strata K as a function of the budget n. We provide asymptotic and finite-time results on how an oracle that has access to the function would choose the partition optimally. I...

متن کامل

Adaptive strategy for stratified Monte Carlo sampling

We consider the problem of stratified sampling for Monte Carlo integration of a random variable. We model this problem in a K-armed bandit, where the arms represent the K strata. The goal is to estimate the integral mean, that is a weighted average of the mean values of the arms. The learner is allowed to sample the variable n times, but it can decide on-line which stratum to sample next. We pr...

متن کامل

Fast balanced sampling for highly stratified population

Balanced sampling is a very efficient sampling design when the variable of interest is correlated to the auxiliary variables on which the sample is balanced. A procedure to select balanced samples in a stratified population has previously been proposed. Unfortunately, this procedure becomes very slow as the number of strata increases and it even fails to select samples for some large numbers of...

متن کامل

Stratified Median Ranked Set Sampling: Optimum and Proportional Allocations

In this paper, for the Stratified Median Ranked Set Sampling (SMRSS), proposed by Ibrahim et al. (2010), we examine the proportional and optimum sample allocations that are two well-known methods for sample allocation in stratified sampling. We show that the variances of the mean estimators of a symmetric population in SMRSS using optimum and proportional allocations to strata are smaller than ...

متن کامل

Weighted Likelihood for Semiparametric Models and Two-phase Stratified Samples, with Application to Cox Regression

Weighted likelihood, in which one solves Horvitz-Thompson or inverse probability weighted (IPW) versions of the likelihood equations, offers a simple and robust method for fitting models to two phase stratified samples. We consider semiparametric models for which solution of infinite dimensional estimating equations leads to √ N consistent and asymptotically Gaussian estimators of both Euclidea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Theor. Comput. Sci.

دوره 558  شماره 

صفحات  -

تاریخ انتشار 2014